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Abstract — Portfolio optimization is the task of allocating the investors capital among 
different assets in such a way that the returns are maximized while at the same time, the 
risk is minimized. The traditional model followed for portfolio optimization is the 
Markowitz model [1], [2], [3]. Markowitz model, considering the ideal case of linear 
constraints, can be solved using quadratic programming, however, in real-life scenario, the 
presence of nonlinear constraints such as limits on the number of assets in the portfolio, the 
constraints on budgetary allocation to each asset class, transaction costs and limits to the 
maximum weightage that can be assigned to each asset in the portfolio etc., this problem 
becomes increasingly computationally difficult to solve, ie NP-hard. Hence, soft computing 
based approaches seem best suited for solving such a problem. An attempt has been made in 
this study to use soft computing technique (specifically, Genetic Algorithms), to overcome 
this issue. In this study, Genetic Algorithm (GA) has been used to optimize the parameters 
of the Markowitz model such that overall portfolio returns are maximized with the standard 
deviation of the returns being minimized at the same time. The proposed system is validated 
by testing its ability to generate optimal stock portfolios with high returns and low standard 
deviations with the assets drawn from the stocks traded on the Bombay Stock Exchange 
(BSE). Results show that the proposed system is able to generate much better portfolios 
when compared to the traditional Markowitz model. 

Index Terms — Genetic Algorithm, portfolio optimization, Markowitz, BSE 

I. Introduction 

Portfolio optimization is the process of choosing the proportions of various stocks to be held in an investment 
portfolio, so as to make the portfolio better than any other on the basis of some criterion. The criterion will 
combine, directly or indirectly, considerations of the expected value of the portfolio's rate of return as well as 
of the return's dispersion and possibly other measures of financial risk. 

Traditional Portfolio Optimization techniques use the Markowitz model, which assists in the selection of the 
most efficient portfolio by analyzing various possible portfolios of the given securities. By choosing 
securities that do not "move" exactly together, the Markowitz model shows a method of risk reduction. 
It is also called the mean-variance model because it is based on expected returns (means) and the standard 
deviation (Variance) of the various portfolios. The model construction is based on the following assumptions 

1. Risk of a portfolio is based on variability of the returns from the portfolio. 

2. Analysis is based on single period model of investment. 
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3. An investor either maximizes his portfolio returns for a given level of risk or maximizes his returns for the 
minimum risk. 

To choose the best portfolio from a number of possible portfolios, each with different returns and risk, two 
separate decisions are to be made: 

1. Determination of a set of efficient portfolios. 

2. Selection of the best portfolio from the efficient set. 

Markowitz model, in the ideal case, can be solved with the help of quadratic programming. However, with 
the addition of constraints such as restrictions on the number of assets, transaction costs, limits on the 
budgetary allocation and capital allocation constraints, the task of identifying solutions to the optimization 
problem becomes increasingly difficult. Hence, soft computing based methods have to be relied upon to 
obtain optimal solutions. 

There have been several studies on the application of soft computing techniques to portfolio optimization. In 
[4], a simulated annealing based approach to solving issues in portfolio selection was presented. It was 
observed that the tailoring work required to account for different classes of constraints and fine-tune the 
parameters of the algorithm was rather delicate (especially for trading constraints. In[5], fuzzy set theory and 
GA was used to generate optimal portfolios with assets selected from the Istanbul Stock Exchange, however, 
it is seen that a certain degree of domain expertise is required for generating the fuzzy membership functions 
. In [6], a variant of Hopfield network was used to generate optimal portfolio and the results were compared 
to those obtained using other approaches such as GA, tabu search and simulated annealing. In [7], application 
of GA to generate optimal portfolios was presented. In [8], a survey of applications of different evolutionary 
algorithms for solving portfolio optimization was presented. 

II. Markowitz Model for Portfolio Optimization 

With any set of assets and their corresponding weights and individual returns, we can find the expected return 
by finding the weighted average of the individual returns of all the assets as: 

E(ret p ) =Z£ =1 w k E(ret k ) (1) 

where: 



N 




k=l 

N = the number of assets 

w k = the weight of asset k 

ret k p = the rate of return of asset k in portfolio p 

E() = the expectation function 

Risk associated with a portfolio is typically measured by the variance. The variance of a single asset is the 
expected value of the sum of the squared deviations from the mean. The variance of a portfolio combination 
of assets is equal to the weighted average co variance of the returns on its individual securities: 

Variance(ret p ) - a p = £ ^=1^1=1 w k w i C° Varire^^e^ ) (2) 

Covariance in terms of correlation coefficient is: 

CoVar[ret k ,ret l ) - Pia \ i = °ui 0) 

Where, 

p kl = correlation coefficient between ret k and ret z 

ret i =the rate of return of asset i 

<7j= standard deviation of ret t 

The variance of the portfolio p is now given as: 

Variance(ret p ) - YJk=i^i=i w k w i ki (4) 
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The standard deviation of the portfolio is the square root of the variance. The mean return of every asset is its 
average value in the sample space. The variance is the average of the squared deviations of the sampled 
average and covariance gives the average value of product of deviations. 

Suppose a portfolio p is constructed of two assets Aj and A 2 with their weights in the portfolio being 
represented by w A± and w Az respectively. Also w A± + w Az = 1 . The expected return of the portfolio p will 
be: 

E(ret p ) = w Ai E(ret A J + w Az E(ret Az ) (5) 

The variance of the rate of return on portfolio p is 

Variance(ret p ) = a I = (w Ai <j Ai + w A2 a A J (6) 

which reduces to 

Op = + w l z °l z + 2w a 1 w A2 P A2 a 2 o Ai o A2 (7) 

The correlation between the asset components (p) is inversely proportional to portfolio risk. That is, the lower 
(even negative) the correlation, the lower the portfolio risk. For example let's assume that P a± a 2 = 1- Th en > 
the right-hand side of equation (7) is a perfect square and simplifies to er p = w A± (T Ai + w A2 a A2 
In this special case, the portfolio mean and standard deviations are simple weighted averages of the asset 
returns. 

A plot between the (mean) returns of the portfolio on the y-axis and the standard deviation from the mean 

(also called the risk factor) on the x-axis, was proposed by Markowitz to identify the portfolio with maximum 

returns or minimum risks. All asset combinations, resulting in different possible portfolios, are plotted in this 

risk-return space. The set of portfolios that give the most optimal returns form the efficient frontier. 

For a given appetite of risk the investor has, the portfolio lying on the efficient frontier gives the best 

portfolio the investor can have. The portfolio selection problem can be formulated as a quadratic program as 

follows: 

min w T Qw (8) 

with 

2Li w fc ret fc ^ ret p 

%k=i w k = 1 , 

w fe > Vfe in [1,7V] and, 

Pw < q (additional constraints) 

where, 

Q e R NXN 

qER M 

p e r nxm 

The above formulated quadratic problem is convex and can be solved using quadratic programming. 
However, presence of constraints such as upper and lower bounds on the asset allocation, upper and lower 
limits on budgetary constraints and the variation in the number of assets (AO cause additional constraints and 
require modification of the constraints specified in (8). This results in an increase in complexity of the 
optimization problem and makes it a good candidate for application of soft computing based techniques such 
asGA. 

IE. Genetic Algorithms 

GA is a parallel search algorithm that can be used for optimizing highly nonlinear objective functions. The 
algorithm starts with a set of candidate solutions randomly selected from the specified search space. The 
collection of these candidate solutions is called the population. The best candidate solutions from the 
population are selected based on a measure of fitness. The selected solutions are called the parents. The 
selected parents undergo crossover and mutation to yield a new set of candidate solutions, called children. 
The collection of parents and children together form the mating pool, which form the basis for the next 
'generation' of solutions. The process of selection, crossover and mutation is repeated every generation and 
this results in improvement of the fitness of the population over generations. The GA terminates when either 
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a predefined terminating condition is encountered or when there is no further improvement in the fitness of 
the population over many generations. 

In the present study, the variables which need to be optimized using GA include: the upper and lower bounds 
on capital allocation to an individual asset, the upper and lower bounds on the budget that can be allocated to 
assets with the rest being held in cash, and the number of assets to be selected to make up the portfolio. 



IV. System Specifications 

The GA optimized Markowitz model is validated on the assets drawn from the Bombay Stock Exchange 
(BSE). A total of 30 large market capitalization companies stocks (those belonging to Group-A of the BSE) 
were selected initially. The time frame of the data was from 22/10/2010 to 26/07/2013, collected on daily 
basis (daily closing prices). Three different categories of portfolios were created, with portfolio category 1 
using GA to optimize only the upper and lower bounds on asset allocation, portfolio category 2 using GA to 
optimize the upper, lower bounds on asset allocation and upper, lower bounds on budgetary allocation and 
portfolio category 3 using GA to optimize upper, lower bounds on asset allocation, upper, lower bounds on 
budgetary allocation as well as the number of assets to be considered in the portfolio. Three different proxies 
for the risk free returns, investments in which, is available to the general public, were also considered. 
The system details are given as follows: 

A. Assets 

A total of 30 stocks belonging to group A [9], listed on the Bombay Stock Exchange are considered. Some or 
all of these stocks could constitute the final optimized portfolio. The stocks considered are: TCS, Reliance 
Inds., ITC, ONGC, Coal India, UltraTech Cement, HDFC Bank , Hindustan Unilever, BhartiAirtel, Wipro, 
NTPC, SBI, ICICI Bank, Tata Motors, HCL Technologies, Sun Pharma, Bajaj Auto, Indian Oil, Idea 
Cellular, Kotak Mahindra Bank, Larsen and Tubro, NMDC, Mahindra & Mahindra, Nestle India, Axis Bank 
Ltd., Asian Paints, Hero Moto Corp, CIPLA, Maruti Suzuki India, GAIL(India). 

B. Risk free rate of return 

Comparative study was carried out based on three different proxies for the risk-free rate: 

1. Government of India bonds [10]: returns 8% p. a 

2. Post office time deposit[ll]: 8.5% returns p.a 

3. National Savings Certificate (NSC) IX Issue (10 years)[12]: 8.8% returns p.a 

C. Buying and selling costs 

The buying and selling costs are set to 0.5%. This figure was arrived at from the information available with 
different brokerage houses. There are varying transaction costs for different plans offered by each of these 
houses and there does not appear to be any standardization in existence as far as transaction costs are 
considered. The transaction costs could be as low as 0.1% (subject to conditions) [13] selecting which, 
would have further boosted the returns generated by our optimized portfolios but to err on the side of caution, 
a transaction cost of 0.5% [14] was considered. Also, no short selling is to be allowed. 

D. GA parameters 

The objective function chosen for the purpose of optimization was: 

minimize ( 1/(1 +avg portfolio returns (X)) 

Where, 

X ={ upper bound, lower bound} for portfolio category 1, 

X = {upper bound, lower bound, upper budget, lower budget} for portfolio category 2, and 

X = {upper bound, lower bound, upper budget, lower budget, number of assets} for portfolio category 3. 

Following GA parameters were used for the optimization process: 
Population size: 20 
Max generations: 100 

Selection: Stochastic uniform (involves laying out a line with each parent corresponding to a section of the 
line of length proportional to its scaled value and then making a pointer move along the line in equal sized 
steps. At every step, the parent from the section where the pointer now lies, is selected. 
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Reproduction: The crossover operation is carried out using a binary crossover mask. A random binary vector 
(called the mask) of the same length as the two parents is created. In The positions where l's are seen in the 
mask, the gene is taken from the first parent and the positions in the mask where O's appear, gene is taken 
from the second parent to finally yield the child. 80% of the population in the next generation is created using 
this crossover. Remaining are elite children that are the best fit members of the current population and 
survive to the next generation. 

Mutation: A Gaussian method of mutation is followed where a random number from a Gaussian distribution 
with mean is added to each entry of the parent vector. 



V. Results 

Performance of the proposed system is presented in the form of three tables Table I-ffl. The average daily 
returns produced by the portfolio on the efficient frontier with maximum returns, is presented. Efficient 
frontiers obtained with the use of default constraints and those obtained with the five GA -optimized 
parameters are presented in Fig. 1-3. Parameters which are not optimized (in portfolio categories 1 and 2) are 
set to their ideal valuesie. no upper and lower bounds on asset allocation, no cash holdings at any time and 
the entire set of 30 assets to be considered. Transaction costs are kept at 0.5% for all the three categories 



Table LThree Categories of GA Optimized Portfolios with Risk Free Returns as Goi Bond Returns 



Portfolio category 


GA optimized parameters 


Avg .Daily 
Returns 


Std Deviation of returns 


Lower 
Bound 


Upper 
Bound 


Lower 
Budget 


Upper 
Budget 


Num 

Asset 

s 




0.02 


0.8 








-9.8406e-5 


0.0083 


2 


0.0129 


0.875 


0.3203 






-9.8406e-5 


0.0083 


3 


0.0143 


0.8 


0.01 




2 


0.0011 


0.0142 




Figure 1 . Efficient frontiers for Goi bonds as risk free proxy. Portfolios on the efficient frontier with ideal values are represented using 
'*' and portfolios onthe efficient frontier for GA optimized parameters are represented using 'A' 
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Table II. Three Categories of GA optimized portfolios with risk free returns as post office time deposits returns 



Portfolio category 


GA optimized parameters 


Avg .Daily 
Returns 


StdDeviation of returns 


Lower 
Bound 


Upper 
Bound 


Lower 
Budget 


Upper 
Budget 


Num 

Assets 




0.020 


0.8 








-9.8406e-5 


0.0083 


2 


0.01 


0.8 


0.02368 






-9.8406e-5 


0.0083 


3 


0.01 


0.8 


0.06 




2 


0.0011 


0.0142 




Figure 2. Efficient frontiers for post office time deposits as risk free proxy. Portfolios on the efficient frontier with ideal values are 
represented using '*' and portfolios onthe efficient frontier for GA optimized parameters are represented using 'A' 



Table III.Three Categories of GA optimized portfolios with risk free returns as NSC returns 



Portfolio category 


GA optimized 


parameters 


Avg .Daily 
Returns 


Std deviation of 
returns 


Lower Bound 


Upper 
Bound 


Lower 
Budget 


Upper Num 
Budget Assets 




0.0200 


0.8 






-9.8406e-5 


0.0083 


2 


0.0165 


0.8241 


0.2977 




-9.8406e-5 


0.0083 


3 


0.0158 


0.8333 


0.7112 


1 2 


0.0011 


0.0142 
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Figure 3. Efficient frontiers for post office time deposits as risk free proxy. Portfolios on the efficient frontier with ideal values are 
represented using '*' and portfolios onthe efficient frontier for GA optimized parameters are represented using 'A' 
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VI. Conclusions 



It can be seen from the results that the use of GA to optimize the Markowitz model parameters significantly 
improves the returns of the portfolio. Optimizing the number of assets in the portfolio has a significant 
impact on the returns generated. It was also seen that there is no significant change in the returns generated 
by the three categories of portfolios for the three different proxies of risk free returns. This could be due to 
the fact that the daily returns generated by these proxies are not very different from one another.When 
compared to the returns generated by investing the capital in BSE-Sensex (the representative index for the 
stocks traded on the BSE) using the traditional buy-and-hold strategy over the same time frame of 710 days, 
the returns of the three portfolio categories is presented in Fig. 4. 
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Figure 4 Percentage returns of the three categories of portfolios (1,2 and 3) compared with buy and hold returns (B&H) 

From Figure 4, it is clearly seen that the GA based optimization of the Markowitz model parameters 
significantly improves the returns when compared to the traditional buy-and-hold strategy. 
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